智能论文笔记

The SIGMORPHON 2022 Shared Task on Morpheme Segmentation

Khuyagbaatar Batsuren , Gábor Bella , Aryaman Arora , Viktor Martinović , Kyle Gorman , Zdeněk Žabokrtský , Amarsanaa Ganbold , Šárka Dohnalová , Magda Ševčíková , Kateřina Pelegrinová

分类：自然语言处理

2022-06-15

Sigmorphon 2022关于词素分割的共享任务挑战了将单词分解为一系列词素的系统，并涵盖了大多数类型的形态：化合物，衍生和弯曲。子任务1，单词级词素细分，涵盖了9种语言的500万个单词（捷克，英语，西班牙语，匈牙利语，法语，意大利语，俄语，拉丁语，蒙古语），并收到了7个团队的13个系统提交，最佳系统平均为97.29％F1在所有语言中得分，英语（93.84％）到拉丁语（99.38％）。子任务2，句子级的词素细分，涵盖了3种语言的18,735个句子（捷克，英语，蒙古人），从3个团队中收到10个系统提交，最好的系统优于所有三种最先进的子字体化方法（BPE（BPE），Ulm，Morfessor2）绝对30.71％。为了促进错误分析并支持任何类型的未来研究，我们发布了所有系统预测，评估脚本和所有黄金标准数据集。

translated by 谷歌翻译

UniMorph 4.0: Universal Morphology

Khuyagbaatar Batsuren , Omer Goldman , Salam Khalifa , Nizar Habash , Witold Kieraś , Gábor Bella , Brian Leonard , Garrett Nicolai , Kyle Gorman , Yustinus Ghanggo Ate

分类：自然语言处理

2022-05-07

通用形态（UNIMORPH）项目是一项合作的努力，可为数百种世界语言实例化覆盖范围的标准化形态拐角。该项目包括两个主要的推力：一种无独立的特征架构，用于丰富的形态注释，并以各种语言意识到该模式的各种语言的带注释数据的类型级别资源。本文介绍了过去几年对几个方面的扩张和改进（自McCarthy等人（2020年）以来）。众多语言学家的合作努力增加了67种新语言，其中包括30种濒危语言。我们已经对提取管道进行了一些改进，以解决一些问题，例如缺少性别和马克龙信息。我们还修改了模式，使用了形态学现象所需的层次结构，例如多肢体协议和案例堆叠，同时添加了一些缺失的形态特征，以使模式更具包容性。鉴于上一个UniMorph版本，我们还通过16种语言的词素分割增强了数据库。最后，这个新版本通过通过代表来自metphynet的派生过程的实例丰富数据和注释模式来推动将衍生物形态纳入UniMorph中。

translated by 谷歌翻译

Cell-Free Data Power Control Via Scalable Multi-Objective Bayesian Optimisation

Sergey S. Tambovskiy , Gábor Fodor , Hugo Tullberg

分类：机器学习 | (统计)机器学习

2022-12-20

Cell-free multi-user multiple input multiple output networks are a promising alternative to classical cellular architectures, since they have the potential to provide uniform service quality and high resource utilisation over the entire coverage area of the network. To realise this potential, previous works have developed radio resource management mechanisms using various optimisation engines. In this work, we consider the problem of overall ergodic spectral efficiency maximisation in the context of uplink-downlink data power control in cell-free networks. To solve this problem in large networks, and to address convergence-time limitations, we apply scalable multi-objective Bayesian optimisation. Furthermore, we discuss how an intersection of multi-fidelity emulation and Bayesian optimisation can improve radio resource management in cell-free networks.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Hyperactive Learning (HAL) for Data-Driven Interatomic Potentials

Cas van der Oord , Matthias Sachs , Dávid Péter Kovács , Christoph Ortner , Gábor Csányi

分类： (统计)机器学习

2022-10-09

Data-driven interatomic potentials have emerged as a powerful class of surrogate models for {\it ab initio} potential energy surfaces that are able to reliably predict macroscopic properties with experimental accuracy. In generating accurate and transferable potentials the most time-consuming and arguably most important task is generating the training set, which still requires significant expert user input. To accelerate this process, this work presents \text{\it hyperactive learning} (HAL), a framework for formulating an accelerated sampling algorithm specifically for the task of training database generation. The key idea is to start from a physically motivated sampler (e.g., molecular dynamics) and add a biasing term that drives the system towards high uncertainty and thus to unseen training configurations. Building on this framework, general protocols for building training databases for alloys and polymers leveraging the HAL framework will be presented. For alloys, ACE potentials for AlSi10 are created by fitting to a minimal HAL-generated database containing 88 configurations (32 atoms each) with fast evaluation times of <100 microsecond/atom/cpu-core. These potentials are demonstrated to predict the melting temperature with excellent accuracy. For polymers, a HAL database is built using ACE, able to determine the density of a long polyethylene glycol (PEG) polymer formed of 200 monomer units with experimental accuracy by only fitting to small isolated PEG polymers with sizes ranging from 2 to 32.

translated by 谷歌翻译

Tensor-reduced atomic density representations

James P. Darby , Dávid P. Kovács , Ilyes Batatia , Miguel A. Caro , Gus L. W. Hart , Christoph Ortner , Gábor Csányi

分类：机器学习

2022-10-02

Density based representations of atomic environments that are invariant under Euclidean symmetries have become a widely used tool in the machine learning of interatomic potentials, broader data-driven atomistic modelling and the visualisation and analysis of materials datasets.The standard mechanism used to incorporate chemical element information is to create separate densities for each element and form tensor products between them. This leads to a steep scaling in the size of the representation as the number of elements increases. Graph neural networks, which do not explicitly use density representations, escape this scaling by mapping the chemical element information into a fixed dimensional space in a learnable way. We recast this approach as tensor factorisation by exploiting the tensor structure of standard neighbour density based descriptors. In doing so, we form compact tensor-reduced representations whose size does not depend on the number of chemical elements, but remain systematically convergeable and are therefore applicable to a wide range of data analysis and regression tasks.

translated by 谷歌翻译

Two-Tailed Averaging: Anytime Adaptive Once-in-a-while Optimal Iterate Averaging for Stochastic Optimization

Gábor Melis

分类： (统计)机器学习 | 自然语言处理 | 机器学习

2022-09-26

通过从其计算中排除了许多随机优化的领先迭代，尾巴平均对Polyak平均的非反应行为进行了改善。实际上，具有有限数量的优化步骤和无法将其退火至零的学习率，尾巴平均可以比单个迭代或polyak平均值更接近训练损失的局部最小点。但是，引导迭代的忽略数量是重要的超参数，并且开始平均太早或太晚导致资源或次优溶液的使用效率低下。将此超参数设置为改善概括更加困难，尤其是在其他超参数和过度拟合的情况下。此外，在平均开始之前，损失只是对最终表现的淡淡信息，这使得早期停止不可靠。为了减轻这些问题，我们提出了任何时间平均变体，该变体没有超参数，并且在所有优化步骤中都近似最佳的尾巴。我们的算法基于两个运行平均值，其自适应长度以最佳的尾巴长度为界，其中一种具有一些规律性的近似最佳性。仅需要两组重量的额外存储空间和对损失的定期评估，提出的两尾平均算法是一种实用且广泛适用的方法，可用于改善随机优化。

translated by 谷歌翻译

Contour Dice loss for structures with Fuzzy and Complex Boundaries in Fetal MRI

Bella Specktor Fadida , Bossmat Yehuda , Daphna Link Sourani , Liat Ben Sira , Dafna Ben Bashat , Leo Joskowicz

分类：计算机视觉

2022-09-25

MRI中胎儿结构的体积测量很耗时，并且容易发生错误，因此需要自动分割。由于胎盘模糊边界和胎儿脑皮层复杂的褶皱，胎盘分割和准确的胎儿脑分割进行回旋评估特别具有挑战性。在本文中，我们研究了对问题的轮廓骰子损失的使用，并将其与其他边界损失以及联合骰子和横向内向损失进行比较。通过侵蚀，扩张和XOR操作员有效地计算出每个切片的损失。我们描述了类似于轮廓骰子指标的损失的新公式。骰子损失和轮廓骰子的组合为胎盘分割提供了最佳性能。对于胎儿脑部分割，最佳性能的损失是结合骰子丢失，随后是骰子和轮廓骰子损失的骰子，其性能比其他边界损失更好。

translated by 谷歌翻译

Partial annotations for the segmentation of large structures with low annotation cost

Bella Specktor Fadida , Daphna Link Sourani , Liat Ben Sira Elka Miller , Dafna Ben Bashat , Leo Joskowicz

分类：计算机视觉 | 机器学习

2022-09-25

深度学习方法已被证明可以有效地分割医学成像中的结构和病理。但是，它们需要大量注释的数据集，其手动分割是一项繁琐且耗时的任务，尤其是对于大型结构。我们提出了一种新的部分注释方法，该方法使用每次扫描中的一小部分连续注释切片，其注释工作仅等于很少的注释情况。通过仅使用带注释的块进行部分注释的培训，将有关切片的信息包含在感兴趣的结构之外，并修改批处理损失函数以仅考虑带注释的切片。为了促进低数据制度中的培训，我们使用两步优化过程。我们用两个MRI序列Trufi和Fiesta用流行的软骰子损失测试了该方法，并将完整的注释状态与部分注释与类似的注释工作进行了比较。对于TRUFI数据，与完整注释相比，部分注释的使用平均表现稍好一些，骰子得分从0.936增加到0.942，并且骰子的标准偏差（STD）大幅下降22％，平均对称表面距离（ASSD）提高15％。对于嘉年华的序列，部分注释还会在分布数据中分别降低骰子分数和ASSD指标的STD和ASSD指标分别降低27.5％和33％骰子得分从0.84到0.9，从7.46降低到4.01毫米。两步优化过程有助于部分注释分别分配和分布数据。因此，建议使用两步优化器的部分注释方法在低数据制度下改善分割性能。

translated by 谷歌翻译

Object Detection Using Sim2Real Domain Randomization for Robotic Applications

Dániel Horváth , Gábor Erdős , Zoltán Istenes , Tomáš Horváth , Sándor Földi

分类：机器人 | 计算机视觉

2022-08-08

在非结构化环境中工作的机器人必须能够感知和解释其周围环境。机器人技术领域基于深度学习模型的主要障碍之一是缺乏针对不同工业应用的特定领域标记数据。在本文中，我们提出了一种基于域随机化的SIM2REAL传输学习方法，用于对象检测，可以自动生成任意大小和对象类型的标记的合成数据集。随后，对最先进的卷积神经网络Yolov4进行了训练，以检测不同类型的工业对象。通过提出的域随机化方法，我们可以在零射击和单次转移的情况下分别缩小现实差距，分别达到86.32％和97.38％的MAP50分数，其中包含190个真实图像。在GEFORCE RTX 2080 TI GPU上，数据生成过程的每图像少于0.5 s，培训持续约12H，这使其方便地用于工业使用。我们的解决方案符合工业需求，因为它可以通过仅使用1个真实图像进行培训来可靠地区分相似的对象类别。据我们所知，这是迄今为止满足这些约束的唯一工作。

translated by 谷歌翻译